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ORIGINAL ARTICLE 

A score to estimate the likelihood of detecting 
advanced colorectal neoplasia at colonoscopy 

Michal F Kaminski, 1,2 Marcin Polkowski, 1,2 Ewa Kraszewska, 2 Maciej Rupinski, 1,2 
Eugeniusz Butruk, 3 Jaroslaw Regula 1,2 



ABSTRACT 

Objective This study aimed to develop and validate a 
model to estimate the likelihood of detecting advanced 
colorectal neoplasia in Caucasian patients. 
Design We performed a cross-sectional analysis of 
database records for 40-year-old to 66-year-old patients 
who entered a national primary colonoscopy-based 
screening programme for colorectal cancer in 73 centres 
in Poland in the year 2007. We used multivariate logistic 
regression to investigate the associations between 
clinical variables and the presence of advanced 
neoplasia in a randomly selected test set, and confirmed 
the associations in a validation set. We used model 
coefficients to develop a risk score for detection of 
advanced colorectal neoplasia. 
Results Advanced colorectal neoplasia was detected in 
2544 of the 35 918 included participants (7.1%). In the 
test set, a logistic-regression model showed that 
independent risk factors for advanced colorectal 
neoplasia were: age, sex, family history of colorectal 
cancer, cigarette smoking (p<0.001 for these four 
factors), and Body Mass Index (p=0.033). In the 
validation set, the model was well calibrated (ratio of 
expected to observed risk of advanced neoplasia: 1.00 
(95% CI 0.95 to 1.06)) and had moderate discriminatory 
power (c-statistic 0.62). We developed a score that 
estimated the likelihood of detecting advanced neoplasia 
in the validation set, from 1.32% for patients scoring 0, 
to 19.12% for patients scoring 7-8. 
Conclusions Developed and internally validated score 
consisting of simple clinical factors successfully estimates 
the likelihood of detecting advanced colorectal neoplasia 
in asymptomatic Caucasian patients. Once externally 
validated, it may be useful for counselling or designing 
primary prevention studies. 



INTRODUCTION 

The strength of evidence regarding the efficacy of 
colorectal cancer screening in reducing the inci- 
dence of colorectal cancer and associated mortality 
is increasing. 1 2 Colorectal cancer screening is cur- 
rently recommended in the European Union 3 ; 
however, adherence to this recommendation is not 
sufficient. ~ One of the most important barriers to 
screening is a lack of perceived risk of colorectal 
cancer among average-risk patients and primary 
care providers. 7 8 The risk of colorectal cancer or 
advanced colorectal neoplasia varies with regard to 
several factors, including age, 9-11 sex, 10-12 family 
history of colorectal cancer, 10 13 smoking, 14 15 
obesity, 11 16 diabetes mellitus, 17 long-term non- 
steroid anti-inflammatory drug use, 15 18 diet 15 19 



Significance of this study 



What is already known about this subject? 

► Colorectal cancer screening is currently 
recommended for average-risk population older 
than 50 years of age, but the adherence to this 
recommendation is generally not sufficient. 

► One of the most important barriers to screening 
is low perceived risk of colorectal cancer 
among average-risk patients and primary care 
providers. 

► Advanced colorectal neoplasia is not evenly 
distributed throughout the 'average-risk' 
screening population. 

What are the new findings? 

► The new score uses age, sex, family history of 
colorectal cancer, cigarette smoking and Body 
Mass Index to estimate the likelihood of 
detecting advanced colorectal neoplasia in 
asymptomatic Caucasian patients. 

► The score estimated the likelihood of detecting 
advanced colorectal neoplasia in so-called 
'average-risk' asymptomatic Caucasian patients 
from 1.3% to 19.1%. 

How might it impact on clinical practice in 
the foreseeable future? 

► The score may increase colorectal cancer risk 
awareness and help healthcare providers to 
encourage people to get screened by adhering 
to existing national screening programmes. 

► By identifying patients at high likelihood of 
detecting advanced colorectal neoplasia, the 
score may help to target primary prevention 
interventions. 



and physical activity. 15 16 Information about some 
of these factors is easy to obtain and could be used 
to indentify patients at high-average risk of 
advanced colorectal neoplasia who are likely to 
benefit the most from screening. This high-average 
risk population should be the target of most inten- 
sive participation improvement interventions and 
primary prevention studies. 

We performed a cross-sectional analysis of data 
from a national colonoscopy screening programme 
to derive and validate a risk prediction model for 
detection of advanced colorectal neoplasia. The 
results of the model were used to develop a simple 
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scoring system that estimates the likelihood of detecting 
advanced colorectal neoplasia in asymptomatic patients. 

METHODS 

Study design and oversight 

We performed a cross-sectional analysis of database records for 
40-year-old to 66-year-old patients who entered the national 
colonoscopy screening programme for colorectal cancer in 
Poland, from January 2007 through December 2007. The data- 
base contained demographic data, colonoscopy and histopath- 
ology results, follow-up information, and the results of an 
epidemiological questionnaire on potential risk factors for 
advanced colorectal neoplasia from 73 screening centres 
throughout Poland. 

The research proposal was reviewed by the Research Ethical 
Committee at the authors' institution and was judged to be 
exempt from oversight. Written informed consent was obtained 
from all participants entering the National Colorectal Cancer 
Screening Program. 

Study population 

Patients between the ages of 50 years and 66 years (40 years 
and 66 years in case of positive family history of cancer of any 
type) were advised by their family or general practitioners to 
participate in the screening. Exclusion criteria were clinical sus- 
picion of colorectal cancer; characteristics that met the criteria 
for Lynch syndrome, familial adenomatous polyposis, or inflam- 
matory bowel disease; and colonoscopy within the preceding 
10 years. For this study, we excluded patients who had 
screening-detected polyps 10 mm or larger that were not 
removed (hence histology was unavailable) and patients who 
had not fully completed the epidemiological questionnaire. 

Study procedures and definitions 

Patients eligible for screening were asked to complete the epi- 
demiological questionnaire regarding the following potential 
risk factors for advanced colorectal neoplasia: age, 9-11 sex, 10-12 
weight and height (to calculate Body Mass Index), 11 16 family 
history of colorectal cancer in first-degree relatives, 10 13 diabetes 
mellitus, 17 smoking history (number of years of smoking, 
number of cigarettes smoked per day, and current smoking 
status), 14 15 and regular aspirin use (use for at least 3 months at 
any dose). 15 18 Information about physical activity, diet, other 
than aspirin non-steroid anti-inflammatory drug use and alcohol 
consumption were not collected. 

Screening colonoscopy procedures have been described else- 
where. 10 All screening colonoscopists and histopathologists par- 
ticipated in the quality assurance programme. 20 Colorectal 
findings were categorised on the basis of the most advanced 
lesion identified at screening (including additional required col- 
onoscopies to remove all polyps, when indicated). 9 10 Advanced 
neoplasia was defined as cancer or adenoma that was at least 
10 mm in diameter, had high-grade dysplasia, had villous or 
tubulovillous histologic characteristics, or any combination 
thereof. 9 10 For the purpose of the analysis, traditional serrated 
adenomas, sessile serrated lesions, and mixed serrated polyps 
were categorised as tubular adenomas. Polyps<10mm in size 
that were not removed or retrieved were categorised as 
non-neoplastic. 

The following predefined categories of variables were used to 
analyse the risk factors for detecting advanced neoplasia: age 
(40-49, 50-54, 55-59, or 60-66 years), sex, family history of 
colorectal cancer (none, one first-degree relative >60 years 
of age with colorectal cancer, one first-degree relative <60 years 



of age with colorectal cancer, or two first-degree relatives with 
colorectal cancer), pack-years smoked (none, <10, 10-19, or 
>20 pack-years), diabetes mellitus (yes or no), Body Mass Index 
(<25, 25-29, or >30 kg/m 2 ), and regular aspirin use (yes or 
no). We performed a sensitivity analysis using age as a continu- 
ous variable and compared its discriminatory power with the 
model using age as a categorised variable. 

Statistical analysis 

The original dataset was randomly partitioned in a 1:1 ratio to 
generate a test set and a validation set, while controlling for the 
distribution of the most advanced lesions. 21 A multivariate logis- 
tic regression model was used to investigate the relation 
between clinical variables and the presence of advanced neopla- 
sia in the test set. 22 The likelihood ratio test was used to deter- 
mine a significant association of a particular variable with the 
presence of advanced neoplasia and the interaction between 
variables. For statistically significant effects, the OR and 95% CI 
were reported for each predefined category of variables. The 
model was internally validated using the validation set. The 
Hosmer-Lemeshow test was used to check the goodness-of-fit 
of the models. 

The calibration of the model was assessed using the validation 
set by comparing the expected and observed numbers of 
patients with advanced neoplasia, overall and for each category 
of variables. 23 Homogeneous participant groups were defined 
by all combinations of categories of significant predictors. The 
expected number of patients with advanced neoplasia for each 
homogeneous group of study participants was calculated by 
summing the estimated individual absolute risk predicted by the 
model developed on the test set. The 95% CIs for the expected 
to observed ratio were calculated by using normal approxima- 
tions to Poisson distributions. 

The concordance statistic was used to measure models' dis- 
crimination among patients with and without advanced neopla- 
sia. For binary logistic regression models, the concordance 
statistic is equivalent to the area under the receiver-operating 
characteristic curve. 

The results of the multivariate logistic regression model were 
used to develop a risk score for detecting advanced neoplasia in 
asymptomatic patients. Model-adjusted coefficients were 
rounded up to the nearest one-half integer and then multiplied 
by two to avoid decimals. 11 The performance of the risk score 
was assessed in the validation set using the concordance statistic. 

A p value <0.05 was considered statistically significant. All 
reported p values are two-sided and not adjusted for multiple 
testing. The analyses were performed using Stata Statistical 
Software, V10 (Stata Corporation, College Station, Texas, USA). 

RESULTS 

Study participants 

Of the 39 265 patients who met the eligibility criteria and 
underwent colonoscopy in one of the 73 screening centres 
between January and December 2007, 3347 (8.5%) were 
excluded, due to incomplete questionnaire feedback (3242 
screened participants, 8.3%) or polyps measuring >10 mm that 
were not removed (105 screened participants, 0.3%). 

The remaining 35 918 patients, 22 164 women (61.7%) and 
13 754 men (38.3%), all Caucasians, had a mean age of 55.6 
±5.2 years. Of the 35 918 patients, 6897 (19.2%) had a family 
history of colorectal cancer, 15 678 (43.7%) had history of 
smoking, 1440 (4.0%) had diabetes mellitus, 7931 (22.1%) had 
a Body Mass Index >35 kg/m 2 , and 4623 (12.9%) reported 
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regular use of aspirin. The characteristics of the study popula- 
tion are summarised in table 1. 



Colonoscopy results 

Colonoscopy was completed to the caecum in 34 469 patients 
(96.0%). A total of 6909 patients (19.2%) had an adenoma or 
cancer. A total of 232 patients (0.6%) had polyps <10mm in 
size that were not removed or retrieved, hence were categorised 
as non-neoplastic abnormalities. Advanced neoplasia was 
detected in 2544 patients (7.1%), including 336 participants 
(0.9%) with adenocarcinoma (table 1). Clinically significant 
complications requiring medical intervention occurred in 42 
patients (0.1%) and included seven cases of perforation (three 
of which occurred after polypectomy), 21 episodes of bleeding, 
nine cardiovascular events, and five other events. No deaths 



occurred as a result of screening colonoscopy or its 
complications. 

Model for the detection of advanced neoplasia 

The test and validation sets consisted of 17 979 and 17 939 
patients, respectively. We built the multivariate logistic regres- 
sion model and used the test set to investigate the predictors of 
detecting advanced neoplasia. The results of the likelihood ratio 
test indicated significant association between the risk of detect- 
ing advanced neoplasia and the following variables: age, sex, 
family history of colorectal cancer, cigarette smoking and Body 
Mass Index (table 2). It also revealed significant association of 
the interaction between sex and Body Mass Index and the risk 
of detecting advanced neoplasia. The following insignificant 
variables were reduced from the model: diabetes and regular 
aspirin use (likelihood ratio test, p values equalling 0.24 and 



Table 1 Demographic and colonoscopy characteristics for the 35 918 study participants* 



Characteristic 



All (N=35 918) 



Women (N=22 164) 



Men (N=13 754) 



Age, years 

Range 

Mean (5D) 
Age group, n (%) 

40-49 

50-54 

55-59 

60-66 

Family history of CRC, n (%) 
None 

One first-degree relative >60 years of age with CRC 
One first-degree relative <60 years of age with CRC 
Two first-degree relatives with CRC 
Smoking history, pack-yearst, n (%) 
None 
<10 
10-19 
>20 

Diabetes mellitus, n (%) 
No 

Yes 

BMI (kg/m 2 ), n (%) 
<25 
25-29 
>30 

Regular aspirin use, n (%) 
No 
Yes 

Not available 
Total colonoscopy, n (%) 
Adequate bowel preparation*, n (%) 
Intravenous sedation, n (%) 
Main colonoscopy findings, n (%) 
None 

Non-neoplastic abnormalities 
Non-advanced neoplasia§ 
Advanced neoplasia!! 



40-66 
55.6 (5.2) 

3606 (10.0) 

1 1 000 (30.6) 

12 379 (34.5) 
8933 (24.9) 

29 021 (80.8) 
4798 (13.4) 
1809 (5.0) 
290 (0.8) 

20 240 (56.4) 
4755 (13.2) 
4576 (12.7) 
6347 (17.7) 

34 478 (96.0) 
1440 (4.0) 

1 1 439 (31 .9) 
16 548 (46.1) 
7931 (22.1) 

28 960 (80.6) 
4623 (12.9) 
2335 (6.5) 
34 469 (96.0) 
33 909 (94.4) 
20 850 (58.1) 

14 415 (40.1) 
14 594 (40.6) 
4365 (12.2) 
2544 (7.1) 



40-66 
55.6 (5.1) 

2125 (9.6) 
6818 (30.8) 
7788 (35.1) 
5433 (24.5) 

17 846 (80.5) 
3000 (13.5) 
1121 (5.1) 

197 (0.9) 

13 737 (62.0) 
3291 (14.8) 
2663 (12.0) 
2473 (1 1 .2) 

21 404 (96.6) 
760 (3.4) 

8258 (37.3) 
9190 (41.5) 
4716 (21.3) 

18 110 (81.7) 
2590 (1 1 .7) 
1464 (6.6) 

21 078 (95.1) 
20 942 (94.5) 
13 565 (61.2) 

9601 (43.3) 
9171 (41.4) 
2207 (10.0) 
1185 (5.3) 



40-66 
55.6 (5.3) 

1481 (10.8) 
4182 (30.4) 
4591 (33.4) 
3500 (25.4) 

11 175 (81.2) 
1798 (13.1) 

688 (5.0) 
93 (0.7) 

6503 (47.3) 
1464 (10.6) 
1913 (13.9) 
3874 (28.2) 

13 074 (95.1) 
680 (4.9) 

3181 (23.1) 
7358 (53.5) 
3215 (23.4) 

10 850 (78.9) 
2033 (14.8) 
871 (6.3) 
13 391 (97.4) 

12 967 (94.3) 
7380 (53.7) 

4814 (35.0) 
5423 (39.4) 
2158 (15.7) 
1359 (9.9) 



•Because of rounding, percentages may not total 100. 

tRegardless of current smoking status. 

tBowel preparation was assessed by endoscopists. 

includes 232 patients with polyps <10 mm in size that were not removed or 
HAdvanced neoplasia was defined as a cancer or adenoma that was >10 mm 
BMI, Body Mass Index; CRC, colorectal cancer. 



retrieved. 

in diameter, had high-grade dysplasia, 



had a villous component, or any combination thereof. 
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Table 2 Associations between individual characteristics and advanced colorectal neoplasia in a multivariable analysis and development of the 
score (test set, N=1 7 979) 



Covariates| 


LR test* 


Adjusted OR (95% CI) 


p Value 


Adjusted p coefficient! 


Risk score! 


Age group, years 


<0.001 










40-49 




1 


0.002 


0 


0 


50-54 




1.53 (1.16 to 2.02) 


<0.001 


0.43 


1 


55-59 




2.29 (1.75 to 3.00) 


<0.001 


0.83 


2 


60-66 




3.14 (2.40 to 4.12) 




1.15 


3 


Family history of CRC 


<0.001 










None 




1 


<0.001 


0 


0 


One first-degree relative with CRC, aged >60 years 




1.40 (1.17 to 1.67) 


<0.001 


0.34 


1 


One first-degree relative with CRC, aged <60 years 




1.66 (1.28 to 2.17) 


0.008 


0.51 


2 


Two first-degree relatives with CRC 




2.11 (1.22 to 3.66) 




0.75 


2 


Sex 


<0.001 










Female 




1 


<0.001 


0 


0 


Male 




2.14 (1.71-2.67) 




0.76 


2 


Smoking history, pack-years§ 


<0.001 










None 




1 


0.654 


0 


0 


<10 




1.05 (0.86 to 1.27) 


0.001 


0 


0 


10-19 




1.34 (1.12 to 1.59) 


<0.001 


0.30 


1 


>20 




1 60 (1 39 to 1 8S> 




0.47 


1 


BMI (kg/m 2 ) 


0.033 










<25 




1 


0.192 


0 


0 


25-29 




1.14 (0.94 to 1.38) 


0.008 


0 


0 


>30 




1.34 (1.08 to 1.67) 


0.081 


0.30 


1 (0)11 


Interaction with male sex** 












25-29 




0.78 (0.59 to 1 .03) 


0.031 


0 




>30 




0.70 (0.51 to 0.97) 




-0.35 





*LR test, the likelihood ratio test for models with and without the specified covariate. 
tNon-significant covariates were assigned p coefficient^. 

tModel-adjusted coefficients were rounded up to the nearest one-half integer and then multiplied by two to avoid decimals. 

§Regardless of current smoking status. 

HI point for female sex and 0 point for male sex. 

"Significant interaction was shown only between male sex and BMI. 

BMI, Body Mass Index; CRC, colorectal cancer; LR, likelihood ratio test. 



0.95, respectively). Table 2 depicts the ORs and 95% CI for 
each category of a significant variable. Tests for goodness-of-fit 
of the models in the test and validation datasets permitted 
acceptance of the fit (p values equalling 0.74 and 0.16, 
respectively). 

The results of the model calibration performed in the valid- 
ation dataset are shown in table 3. The ratio of expected to 
observed risk of advanced neoplasia was 1.00 (95% CI 0.95 to 
1.06) overall, 1.03 (95% CI 0.97 to 1.12) in women, and 0.98 
(95% CI 0.91 to 1.06) in men, indicating good calibration. The 
concordance statistics of the model were 0.64 for the test set 
and 0.62 for the validation set, indicating moderate discrimin- 
ation. A sensitivity analysis performed in a test set, with age as 
continuous variable showed comparable concordance statistics 
of the model (0.64, 95% CI 0.63 to 0.66; X 2 P value = 0.82). 

The score to predict detection of advanced colorectal 
neoplasia 

The adjusted p coefficients of the logistic regression model fitted 
on the test set were used to develop the risk score by estimating 
the likelihood of detecting advanced neoplasia for each category 
of significant factors (see table 2). The scores for Body Mass 
Index >30 kg/m 2 for different sexes were adjusted according to 
the interaction coefficient. The score calculated for each person 
from the validation set estimated the likelihood of detecting 
advanced neoplasia from 1.32% for patients with a score of 0 to 



19.12% for patients with scores of seven and eight (figure 1). 
The performance characteristics of the score in the validation set 
are shown in table 4. The concordance statistic for the simplified 
score in the validation set was 0.62 (95% CI 0.60 to 0.64); the 
course of the receiver operating characteristcs curve is shown in 
online supplementary figure SI. Online supplementary table SI 
depicts the ratio of expected to observed risk for advanced colo- 
rectal neoplasia in the validation set by simplified score. 

DISCUSSION 

Our previous study found that male sex, age of 50 years or 
more and family history of colorectal cancer were independent 
risk factors for detecting advanced colorectal neoplasia. 1 In the 
ensuing discussion, it has been suggested that the observed dis- 
parity of advanced neoplasia risk between men and women 
might have merely reflected sex-based differences in smoking 
patterns. 25 In the present study, we used a new dataset to derive 
and validate a model for the detection of advanced colorectal 
neoplasia that included smoking status and other potential con- 
founders, such as age, sex, family history of colorectal cancer, 
and Body Mass Index. We confirmed previously identified asso- 
ciations, and also found that smoking >10 pack-years, and Body 
Mass Index >30 kg/m 2 were independent risk factors for detect- 
ing advanced colorectal neoplasia. Our study corroborated pre- 
viously identified risk factors for advanced colorectal 
neoplasia; 10-16 it also, for the first time, combined all five 



Kaminski MF, era/. Gut 2014;63:1112-1119. doi:10.1 1 36/gutjnl-201 3-304965 



1115 



Endoscopy 



Table 3 Ratio of expected to observed risk for advanced colorectal neoplasia in the validation dataset (N=1 7 939) 


Variables 


Observed risk 


Expected risk 


Expected to observed ratio 


(95% CI) 


Overall 


1270 


1275.3 


1.00 


0.95 to 1.06 


Sex 










Female 


582 


601.5 


1.03 


0.95 to 1.12 


Male 


688 


673.8 


0.98 


0.91 to 1 .06 


Age group, years 










40-49 


83 


76.2 


0.92 


0.74 to 1.14 


50-54 


331 


280.1 


0.85 


0.76 to 0.94 


55-59 


443 


468.6 


1.06 


0.96 to 1.16 


60-66 


413 


450.4 


1.09 


0.99 to 1 .20 


Family history of CRC 










None 


1028 


1010.2 


0.98 


0.92 to 1 .05 


One first-degree relative with CRC, aged >60 years 


170 


176.9 


1.04 


0.90 to 1.21 


One first-degree relative with CRC, aged <60 years 


57 


69.6 


1.22 


0.94 to 1.58 


Two first-degree relatives with CRC 


15 


18.6 


1.24 


0.75 to 2.06 


Smoking history, pack-years* 










None 


582 


609.5 


1.05 


0.97 to 1.14 


<10 


164 


141.9 


0.87 


0.74 to 1 .01 


10-19 


200 


185.7 


0.93 


0.81 to 1.07 


>20 


324 


338.2 


1.04 


0.94 to 1.16 


Women BMI, kg/m 2 










<25 


219 


193.3 


0.88 


0 77 tn 1 01 


25-29 


234 


256.5 


1.10 


0.96 to 1 .25 


>30 


129 


151.6 


1.18 


0.99 to 1.40 


Men BMI, kg/m 2 










<25 


161 


158.7 


0.99 


0.85 to 1.15 


25-29 


331 


344.1 


1.04 


0.93 to 1.16 


>30 


196 


171.0 


0.87 


0.76 to 1.00 



"Regardless of current smoking status. 

BMI, Body Mass Index; CRC, colorectal cancer. 



important factors and their categories in a multivariate analysis 
and confirmed obtained results in a validation set. 

The present model was well calibrated overall, as well as in 
men and women, as verified in the validation set, which means 
that the observed risk of advanced colorectal neoplasia well 
fitted the expected risk. Therefore, we used the model to 
develop a simple score for the detection of advanced colorectal 



.2- 




0 1 2 3 4 5 67 or 8 



Figure 1 The prevalence of advanced colorectal neoplasia or cancer 
in the complete dataset (N=35 918) by simplified score. Advanced 
neoplasia includes advanced adenomas and cancers. Due to small 
sample sizes, risk scores 7 and 8 are presented together. 



neoplasia in asymptomatic patients. The score, based on age, 
sex, family history of colorectal cancer, smoking status and 
Body Mass Index, estimated the likelihood of detecting 
advanced colorectal neoplasia in the validation set from 1.32% 
to 19.12% in patients with 0 to 7-8 points, respectively. The 
estimation of individual risk of detecting advanced colorectal 
neoplasia may help asymptomatic patients and healthcare provi- 
ders to make informed decisions about screening. 26 For 
example, the likelihood of detecting advanced colorectal neopla- 
sia in a 5 3 -year-old, overweight, never-smoking woman, with 
one first-degree relative 60 years of age or older with colorectal 
cancer, is difficult to compare with that of a 56-year-old man 
who smoked for 20 pack-years, but has healthy weight and no 
family history of colorectal cancer. However, based on the 
results of the present model, the respective likelihood of detect- 
ing advanced neoplasia for two such patients are 4.65% and 
12.46% (or 4.57% and 11.27%, respectively, using simplified 
scoring). Such results do not mean that one should discourage 
the woman from participation in an existing screening pro- 
gramme in a given country aiming at average risk group; rather 
they indicate that the man should be specifically encouraged to 
be screened, because the likelihood of detecting advanced neo- 
plasia in his colorectum is almost twice that of the average 
screening population. For ease of clinical application, the 
present model could be transformed into an online calculator of 
the likelihood of detecting advanced colorectal neoplasia and 
used in mobile easy access media. Although lack of symptoms 
and low perceived risk of colorectal cancer are considered major 
barriers to screening, 7 8 it is unknown, whether providing the 
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Table 4 Performance characteristics of the score in the validation set (N=17 939) 



Advanced neoplasi* 



Risk score 


Persons n 


N 


Per cent 


Sensitivity, % 


Specificity, % 


Positive predictive value, % 


Negative predictive value, % 


0 


152 


2 


1.32 


100.00 


0.00 


7.08 


- 


1 


2259 


95 


4.21 


99.84 


0.90 


7.13 


98.68 


2 


3608 


165 


4.57 


92.36 


13.88 


7.55 


95.98 


3 


4626 


284 


6.14 


79.37 


34.54 


8.46 


95.65 


4 


3745 


290 


7.74 


57.01 


60.59 


9.93 


94.87 


5 


2432 


274 


11.27 


34.17 


81.31 


12.23 


94.19 


6 


981 


134 


13.66 


12.60 


94.26 


14.32 


93.40 


7 and 8t 


136 


26 


19.12 


2.05 


99.34 


19.12 


93.01 


Total 


17 939 


1270 


7.08 











'Advanced neoplasia includes advanced adenomas and cancers. 
tDue to small sample risk scores 7 and 8 are presented in one row. 



estimate of individual risk could facilitate the informed decision 
to undergo endoscopy screening in a similar way as it worked 
for prostate cancer screening. 27 It is particularly unknown, what 
kind of effect on participation in screening, would have the 
lower than average estimate of likelihood of detecting advanced 
colorectal neoplasia. 

Another potential application of a model for the detection of 
advanced colorectal neoplasia is to guide practical recommenda- 
tions for mass screening; however, this application would 
require a model with high discriminatory power. 28 The present 
model had only moderate concordance statistic value, compar- 
able to that of previously published models for the detection of 
advanced colorectal neoplasia in Western populations, 11 2 even 
though the present model included more risk factors than previ- 
ous models did. Three issues may explain this observation. First, 
the model of Betes et al 11 lacked validation, which may have led 
to overestimation of its discriminatory power. Second, the 
models of Betes et al 11 and Lin et al 29 were derived from popu- 
lations with a broader age range, which may have increased 
their discriminatory power, because age is the most powerful 
clinical risk factor for advanced colorectal neoplasia. Third, add- 
itional independent risk factors included in the present model 
were too weak to significantly change its discriminatory 
power. 30 The models for the detection of advanced colorectal 
neoplasia in East Asian populations demonstrated variable dis- 
criminatory power. 31 32 The model by Yeoh et al 31 demon- 
strated discriminatory power comparable to that achieved in 
Western populations, while the model by Cai QC et al 32 demon- 
strated better discrimination. The latter model missed family 
history of colorectal cancer but included various dietary factors, 
which (in contrast to previous studies 15 16 ) showed strong asso- 
ciation with the risk of advanced colorectal neoplasia; however, 
these factors are prone to recall bias. Therefore, it is rather 
unlikely that a model based on simple, reliable clinical factors 
alone would ever have sufficient discriminatory power to limit 
the target population for screening. It is corroborated by the 
results of a very recent study by Tao S et al. 33 The model that 
included risk factors missing in the present study (alcohol con- 
sumption, red meat consumption, ever regular use of non- 
steroidal anti-inflammatory drugs, previous colonoscopy and 
previous detection of polyps), also demonstrated moderate dis- 
criminatory value. 33 On the other hand, indirect comparison 
suggests that the present model's sensitivity and specificity for 
advanced colorectal neoplasia may be comparable with a single 
round guaiac faecal occult blood test (mostly due to poor diag- 
nostic performance of guaiac faecal occult blood test for 



detection of advanced adenomas). 34 35 Although the model has 
considerably lower discriminatory power for advanced colorec- 
tal neoplasia compared to the one reported for faecal immuno- 
chemical tests, 34-37 it has been shown that combining clinical 
risk factors with faecal immunochemical test outcome results in 
improved discrimination. 38 39 Therefore, it is likely that in the 
future the clinical factors identified in the model will be com- 
bined with results of faecal immunochemical test and/or blood- 
based biomarkers to select a target population for colonoscopy. 

Our study has certain notable features. Despite a large sample 
size, we have not identified any statistically significant associ- 
ation between diabetes mellitus or aspirin use and the risk of 
advanced colorectal neoplasia. Although diabetes mellitus is a 
known risk factor for colorectal cancer, 17 its association with 
advanced colorectal neoplasia is less certain. 31 The observed 
lack of association may also be due to the lower-than-expected 
prevalence of diabetes mellitus in the study cohort, 40 which 
likely reflects recall bias or self-selection to opportunistic 
screening. 41 

The lack of a statistically significant association between 
aspirin use and the risk of advanced colorectal neoplasia in our 
study may be due to recall bias or missing data regarding the 
dose and regularity of aspirin use. Moreover, we have not col- 
lected the data on other non-steroid anti-inflammatory drugs, 
which in some studies were analysed together with aspirin. 15 

Advanced neoplasia, not just cancer, was chosen for analysis 
because it has been suggested as the most appropriate target for 
endoscopy screening. 9-12 15 29 31 32 Although some previous 
risk prediction models were developed for cancer alone, 23 42 43 
cancers and advanced neoplasia are surrogate endpoints of 
primary cancer screening endpoint, which is colorectal cancer 
mortality. 44 Early detection and treatment of colorectal cancer is 
associated with a reduction in colorectal cancer mortality, 2 but a 
detection and removal of adenomas, especially advanced ones, 
is associated with additional reduction in colorectal cancer inci- 
dence and mortality. 45 46 Therefore, it is uncertain, whether 
cancer alone or advanced neoplasia is a better endpoint for risk 
prediction models, but the latter may be particularly suited for 
use in endoscopy screening. 

The primary endpoint of our model was advanced neoplasia 
located anywhere in the colorectum, therefore, the risk score is 
not optimised for sigmoidoscopy screening. Nevertheless, we 
built an additional model to investigate risk factors for detecting 
distal advanced neoplasia, using sigmoid-descending colon junc- 
tion as an artificial boundary between distal and proximal 
colon. The model for the detection of distal advanced neoplasia 
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identified the same risk factors and showed comparable discrim- 
inatory power (data not shown). 

The limitations of our study require comment. First, the val- 
idation process was limited because it was performed in a 
dataset that was randomly selected from a population recruited 
in the same setting. The model's performance has not been 
tested outside Poland or in non-Caucasians. Nonetheless, the 
National Colorectal Cancer Screening Program recruited partici- 
pants in 73 centres located in all administrative and geographic 
regions of Poland, and was open free of charge to all eligible 
Polish citizens, providing our study with sociodemographic 
diversity. Moreover, the prevalence of advanced colorectal neo- 
plasia identified in our study was 7.1%, which is within the 
range of values reported in studies performed in the USA 9 15 29 
and Europe. s 10 11 Additionally, the adjusted ORs for detecting 
advanced colorectal neoplasia in various categories of risk 
factors in our study are similar to that reported in previously 
published large studies. 10 31 32 47 48 Notably, in our previous 
study, 10 performed several years before and in different endos- 
copy centres, we used the same key to categorise age, family 
history of colorectal cancer and gender and yielded virtually the 
same adjusted ORs for each category of variables. 

Second, our cohort does not fully cover the recommended 
age range for screening (people aged 67-75 years were not 
included) what may limit the applicability of the results for the 
entire population eligible for screening. 49 On the other hand, by 
including people at the lower age range for screening (people 
aged 40-49 years with family history of cancer), this model may 
help to identify and encourage younger people at considerable 
risk to undergo screening. 

Third, given the cross-sectional design of the study, our risk 
score is suitable only to predict the detection of advanced neo- 
plasia at the present time, and not the future risk of developing 
advanced colorectal neoplasia or dying from colorectal cancer. 

In summary, we derived and internally validated a model that 
predicts the likelihood of detecting advanced colorectal neopla- 
sia in asymptomatic Caucasian patients based on age, sex, 
smoking habits, Body Mass Index, and family history of colorec- 
tal cancer. The results of the model were used to develop a 
simple score that estimates the likelihood of detecting advanced 
colorectal neoplasia. Once externally validated, the score may 
be useful for counselling or designing primary prevention 
studies. 
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